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Abstract 

Let N, Ni and N2 be point processes such that A^i is obtained from N by homogeneous 
independent thinning and = N — Ni. We give a new elementary proof that A^i and A''2 
are independent if and only if is a Poisson point process. We present some applications 
of this result to test if a homogeneous point process is a Poisson point process. 
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1 Introduction 



In spatial statistics, it is common to consider simultaneously two spatial point patterns. A 
common applied setting is that where the researcher considers a pattern composed by the 
location of disease cases in a planar region, and another set of locations labeled as control 
individuals. Generally, the location of case and control individuals are their residences. The 
attention in the first situation is concentrated on the comparison of the marginal distributions 
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of the two processes. Usually, the interest is to decide if the disease cases have some degree 
of spatial clustering with respect to the controls' pattern (Diggle, 1993; Kelsall and Diggle, 
1995), specially around putative sources of increased risk (Diggle, 1990; Diggle and Rowlingson, 
1994). If cases and controls exhibit the same spatial pattern, it makes sense to consider the null 
hypothesis that cases and controls are independent random samples from the same population 
at risk. This hypothesis implies that, conditionally on the observed locations of cases and 
controls, the events are labeled by the random outcome of fliping a coin of constant probability 
p, where p reflects the relative sizes of the cases and controls samples. It is usual to carry out 
the test conditioned on the observed number of cases of controls. 

Another common situation in spatial statistics is when the interest concentrates on testing 
the independence of two point patterns and therefore attention is directed to the joint distri- 
bution of the processes. For example, the researcher could be studying two species of plants in 
the same region. Prom theoretical reasons or empirical knowledge, the species could be known 
to have quite different spatial configurations. Therefore, there would be no interest in testing 
if they arise by randomly labeling an original process. In this situation, it is more usual to 
test either they arc independent point processes or, alternatively, if there is interaction between 
the two processes (Lotwick and Silverman, 1982; Wiegand et al., 2000). If the independence 
hypothesis holds, the expected number of individuals from one species in a disc centered at 
X — {xi, X2) is independent of the presence in x of an individual from the other species. 

Hence, the two hypothesis are considered in very different situations and they imply different 
consequences to the observed point patterns. However, these two hypothesis are not exclusive. 
Suppose that N is a Poisson process with intensity function X{x) and that N = Ni + N2 where 
Ni is a thinning of obtained through the function p{x) = p, a constant independent of A^. 
That is, A^i is a random labeling of the A^ events. Then, it is well known that A^i and N2 are 
independent Poisson processes with intensities p \{x) and (1 — p) \{x), respectively (Cressie, 
1991, page 690). 

This result raises the question of the converse statement. Consider a point process A^i 
arising as a random thinning of a point process A^ and let A^2 be the complementary point 
process such that A^ = A^i + A^2- If A^i and A^2 are independent point processes, is it true that 
A^ is a Poisson point process ? The answer is positive and, since the Poisson process is the only 
point process with this property, this result gives a characterization of this process. 

This characterization result is not well known among spatial statisticians but it is not 
new. Srivastava (1971) proved this characterization for the particular case of stationary point 
processes evolving in time. He provided a short proof using two previous results: a Poisson 
distribution characterization by Moran (1952) and the characterization of a Poisson process by 
the Poisson distribution on compact sets by Renyi (1967). Fichtncr (1975) extended Srivastava's 
(1971) characterization theorem for non-stationary point processes occuring in W^. 

In this paper, we present a new proof of this characterization of Poisson processes, possibly 
non-stationary. We believe our proof is simpler than Fichtner's. It only uses Moran's theorem 
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and well known point processes results. We also present a new and elementary proof of Moran's 
theorem without using characteristic functions (Lemma 1 below). Since this important char- 
acterization theorem is absent even from major point processes references, such as Daley and 
Vere- Jones (1988), we think it will be useful for spatial statisticians to present it here. 

Based on this characterization theorem, we present a new approach to test for spatial pattern 
in an observed point process. Although the theorem characterizes also inhomogeneous Poisson 
processes, in this paper we concentrate on the detection of homogeneous Poisson process. We 
present two different tests, one based on the bivariate K function, and another based on empty 
space methods. 

We give the definitions and set the notation in Section 2 where we also prove our main 
result concerning the characterization of the Poisson process through the independence of the 

processes formed by randomly labeling an initial process. In Section 3, we discuss some impli- 
cations for statistical inference about homogeneous Poisson point processes and we finish with 
discussion and conclusions in Section 4. 



2 The characterization of a Poisson point processes 

Let be a point process in R"* with locally finite intensity u: ^{K) < oo for each compact set 
K. Let Ni and Ni independent thinnings of N with acceptance value p and 1 — p respectively, 
with p e [0, 1]. These processes are characterized by 

F{N,{K)=t,N2{K)=j)=F{N{K)=t+j)('^.Ap\l-py (1) 



for any compact set K. 



Theorem 1 The process N is a Poisson process if and only if Ni and N2 are independent. 

The proof of the theorem is based on an elementary lemma about Poisson random variables. 
Let Z he a random variable with values in N and finite mean A > 0. Let (f/j : i G N) be a 
sequence of independent random variables and independent of Z with Bernoulli distribution: 

P(C/i = l) = l-P(C/, = 0)=p (2) 

where p e [0, 1] is a parameter. 

Let X and Y be thinnings of Z using Uf 

z z 

X:^J2Ur, y:=^(l-C/0 (3) 

i=i i=i 
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By Wald identity 

EX = Xp; EY = A(l - p) . 



(4) 



Write Tk = P(Z = k),pk = P(X = k) and qk = P(F = k). Then, by definition: 





(5) 



and 




(6) 



Lemma 1 Tie variable Z has Poisson distribution if and only if X and Y are independent. 

Proof. The imphcation "Z Poisson imphes X and Y independent" is in textbooks (Cressie, 
1991, page 690, for instance). To show the reverse we first estabhsh the strict positivity of all 
r„. Since X and Y are independent, then 



By r„ > implies > and Pk > for all k < n. Since Z is not identically equal to zero, 
r„ > for some n > 1 and hence po, qo,pi, qi > 0. By Tq = Poqo > and ri > poQ'i > 0. By 
induction, fixing n > 1 and assuming r„ > 0, we get r„+i > > 0. This shows that r„ > 
for all n > 0. 

Using the hypothesis of independence and taking alternatively i = x, j=y + l and then 
i = X + 1 and j = y in (H) we get 




(7) 






from where 



PxQy+liy + i)p = Vx+iqy{x +i){i-p). 



(10) 



Fixing X = 0, ( |10|) and the fact that (g^) is a probability imply that must satisfy: 




(11) 



(12) 
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whose solution is: 



) 



e 



y >0 



(13) 




and Z = X + y, Z must be Poisson. □ 

Proof of Theorem 1. A point process is completely determined by the null probabilities 
{F{N{K) = : K compact) (Theorem 7.3. II in page 216 of Daley and Vere- Jones, 1988). 
Denoting Z = N{K), X = Ni{K) and Y = N2{K), we have that Z, X, Y satisfy the hypothesis 
of Lemma 1 with A = i^iK). Hence N is Poisson with intensity u. □ 

3 New tests for homogeneous Poisson point processes 

This characterization of the Poisson process suggests a different way to test if a point process is 
a stationary Poisson process. Assume X is a stationary process and, using a coin with success 
probability p, randomly label some of its events with mark 1, the remaining events being 
marked as 2. Only the stationary Poisson process has the two marked processes independent. 
Therefore, to test if the randomly labelled processes Ni and N2 are independent is equivalent 
to test the hypothesis that X is a stationary Poisson process. 

The usual way to test if two stationary processes observed in a finite sampling window A 
with area \A\ are independent is that proposed by Lotwick and Silverman (1982) based on 
conditional Monte Carlo tests (Ripley, 1977; Besag and Diggle, 1977) when A is a rectangle. 
Firstly, a suitable test statistic is chosen refiecting a particular alternative hypothesis of interest. 
If no specific alternatives are enviosioned, it is common to consider the bivariate Ripley's K 
function defined for (i > by 



where Aj is the first-order intensity of process Ni and Ai2('u) is the second-order intensity 
function of processes Ni and N2. It is clear that Ki2{d) = K2i{d). From the definitions, it 
follows that, under independence of Ni and A'"2, we have Ki2{d) = vrd^, whatever the marginal 
distributions of the two processes. 

Let rii and n2 be the number of events of Ni and N2, respectively, observed in the rectangular 
sampling window A. Convert the rectangle to a torus by identifying the opposite edges of A. 
With this toroidal idea, no edge correction is necessary in the definition of the estimator of 
Ki2{d). Define Id{u) to be 1 if m < d, and otherwise. Let Uij be the distance from the ith 
Ni-tjpe event located at Xn to the jth N2-tjpe event. 




(14) 
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The test statistic is based on the empirical function Kui^d), first proposed by Hanisch and 
Stoyan (1979), and defined by 

K^d) = in.n^r'lAlJ^J^Uu,,) (15) 

i=i j=i 

The equahty Ki2{d) = K2i{d) is also valid for its empirical counterparts Ki2{d) and K2i{d) in 
this case of a toroidal region. 

Keeping the A^^i process fixed, randomly shift the observed N2 pattern in the torus and recal- 
culate Ki2{d). After many independent shifts, we have the empirical distribution of Ki2{d) un- 
der independence of the processes conditioned on the observed marginal structure. Percentiles 
from this distribution for several different values of d can be used to construct acceptance 
envelopes for the hypothesis. 

In the procedure we are proposing, the A^^i events are chosen out of those from the process 
independently with probability p. To choose the value of p, consider the variance of (p!5[) . If Ni 
and are independent Poisson processes then, conditionally on the values of rii and n2, 

711 ri2 

Var{KM) = iri^n2)-M?Y. C-^^ U^K', )) (16) 

ni 712 

= (nin2)-'|A|2 J^V^ar(J,(M,,)) 
1=1 j=i 

= {nin2)~'^\A\War {l[ix-Y\<d]) 

where X and Y are independent random variables uniformly distributed over A identified with 
the torus (Silverman, 1978). If rii + n2 = n is fixed, the optimal choice of rii and n2 in the 
sense of minimizing the variance (|l^ is given by rii = n2 = n/2. This suggests labeling the 
processes with p = 0.5. 

Another possible test statistic is based on the avoidance set function P{N{A) = 0) or 
"empty space" techniques. There are examples of ergodic stationary dependent bivariate point 
processes that are judged independent by second-order methods, such as the Ki2{d) function, 
but with interactions detected by the avoidance function (Lotwick, 1984). This leads to the 
consideration of another test. 

Let Gi{d) be the probability that a disc of radius d contains no events of the A^i process. 
Define G2{d) and G{d) similarly for the processes A^2 and A^ = A^i + A^2, respectively. If A^i 
and A'2 are independent processes we have the following identity holding for all d: G{d) = 
Gi{d) G2{d). As a consequence, we can use the following statistic to investigate the interaction 
between the processes A^i and A'2: 

T{d) = logG(rf) - logGi(rf) - logG2(rf) (17) 
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As previously described, a conditional Monte Carlo test is used to assess the significance of 
empirical estimates of T{d). Lotwick and Silverman (1984) use the Green-Sibson Dirichlet 
tesselation algorithm for computing the function estimates while we prefer to estimate them 
from m randomly distributed sample points in A as decribed in Diggle (1983, page 20). 

To choose the value of p, consider the variance of (p^Tf). Assuming that is a Poisson process 
with intensity A and n observed events and ignoring boundary effects, we use a standard delta 
method argument (Taylor expansion) to find 

Var{T{d)) ^ i (e'^"'^' + 1 - (e^'^"'^' + e^^-^'^^^^^'^ . (18) 

It is clear that the variance is zero when j9 = or j9 = 1. The reason is that, in this case, 
T{d) = because either N = Ni or N = A'2. Since logG{d) is fixed whatever value of p 
is chosen, a better strategy is to select p to minimize the variance of the log{Gi{d) G2{d)) 
estimator. Hence, 

Var (logGi(rf) - logG2(rf)) ^ ^ (^(gP^A-d^ + e(i-p)2A.d2 _ 2^ ^ (^g) 

which is minimized when p = 0.5, giving a minimum of 2/?2(exp(7rA(i^) — 1) > 0, if (i > 0. As 
we found previously, this new result also suggests to label the processes using p = 0.5. 

Example 

We illustrate the techniques described with some real data: the locations of 62 redwood seedlings 
in a square of 23 meters, the locations of 42 biological cell centers in a unit square, and the 
locations of 65 Japanese black pine saplings in a square of side 5.7 meters. All the data are as 
reported by Diggle (1983) from the references therein. 

Figure 1 around here 

Figure 1 shows the results of our two tests, based on the Ki2{d) function and in the empty 
space function. The first, second and third columns of plots refer to the redwood seedlings, cell 
centers, and pine saplings, respectively. The first row of plots shows the three point patterns. 
We used p = 0.5 to generate the thinned processes showed as circles and crosses in Figure 1. 
The second and third rows of plots refer to the Kui^r) and T{d) tests, respectively. 

Several tests have been used previously in these datasets and usually they accept the hy- 
pothesis of a homogeneous Poisson process for the pine saplings, and reject this hypothesis for 
the clustered redwood seedlings pattern and the regularly spaced cell centers. Our tests find 
these same results as can be seen by the behavior of the observed Ki2{r) and T{d) functions 
with respect to the 95% confidence envelopes. Both test functions lie outside the envelopes for 
the first and second datasets and inside the envelope for the third dataset. 
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4 Discussion and conclusions 



A fundamental property characterizing the Poisson process is the independence of counts on 
disjoint areas. The characterization theorem presented in this paper suggests that independence 
of random partitions of events in the same area is also capable of characterizing the Poisson 
process. This is another justification for the usual labeling of a homogeneous Poisson process 

as complete spatial randomness (Digglc, 1983). 

A result related to this theorem is Raikov's theorem (see Daley and Vere- Jones, 1988, page 
31) which shows that if Z is a Poisson random variable expressible as a sum Z — X + Y oi 
independent nondegenerate, nonnegative random variables then X and Y are Poisson random 
variables. The present characterization theorem drops the hypothesis that Z has a Poisson 
distribution and shows that this is a consequence of the independence of X and Y if they are 
obtained through the thinning of Z. 

We used the characterization result to propose two tests, based on empty space and second- 
order methods, for the hypothesis that a point process is a stationary Poisson process. It has 
not been considered in this paper their relative power in detecting departures from the null 
hypothesis of a Poisson process and the kind of departure detected by the two techniques. 
Likewise, we have not considered the relative merits of other techniques such as that based on 
the K function of the single type N process. 

Although the characterization result is also valid for non-homogeneous Poisson point pro- 
cesses, it is not clear how this could be used to set up a hypothesis test in this case. The main 
problem in the non-homogeneous situation is the dependence of the thinned processes on the 
unknown first-order intensity function of N. Similar problems have made difficult the estima- 
tion of second moment functions for stationary Cox process (Chetwynd and Diggle, 1998). 

The examples in Section 3 demonstrate that tests for interaction between two complemen- 
tary processes obtained through the random thinning of a stationary point process can be an 
alternative to test the hypothesis that A^ is a stationary Poisson process. Both tests, that based 
on the Ki2 function and that based on empty space methods, lead to the same conclusions in 
the examples considered in this paper. These conclusions are the same reached using other 
tests previously proposed in the literature. 
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Figure 1: The first, second and third columns of plots refer to the redwood seedhngs, cell 
centers, and pine saplings datasets, respectively. The first, second and third rows of plots refer 
to the point patterns, the bivariate K functions and the T[d) statistic, respectively. The dashed 
lines are approximate 95% confidence bands. 
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